Modeling Aqueous Solubility

نویسندگان

  • Darko Butina
  • Joelle M. R. Gola
چکیده

This paper describes the development of an aqueous solubility model based on solubility data from the Syracuse database, calculated octanol-water partition coefficient, and 51 2D molecular descriptors. Two different statistical packages, SIMCA and Cubist, were used and the results were compared. The Cubist model, which comprises a collection of rules, each of which has an associated Multiple Linear Regression model (MLR), gave better overall results on a test set of 640 compounds with an overall squared correlation coefficient of 0.74 and an absolute average error of 0.68 log units. Both training and independent test sets had similar distributions of structures in terms of the different functionalities present-60% neutral, 14% acidic, 8% phenolic, 11% monobasic, 4% polybasic, and 3% zwitterionic molecules. Sets were designed by random selection, with 2688 (81%) and 640 (19%) molecules, respectively, forming the training and the test sets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Correlation and Prediction of Acid Gases Solubility in Various Aqueous Alkanolamine Solutions Using Electrolyte Cubic Square-Well Equation of State

The object of this work is solubility correlation and prediction of CO2 and H2S in various aqueous alkanolamines using the electrolyte cubic square-well equation of state (eCSW EoS) (Haghtalab, A.,Mazloumi, S. H., (2010), Electrolyte Cubic Square-Well Equation of State for Computation of the Solubility CO2 and H2S in Aqueous MDEA Solutions,  Ind. Eng. Chem. Res.,49,6221-623). The eEoS systemati...

متن کامل

Aqueous Solubility Prediction Based on Weighted Atom Type Counts and Solvent Accessible Surface Areas

In this work, four reliable aqueous solubility models, ASM-ATC (aqueous solubility model based on atom type counts), ASM-ATC-LOGP (aqueous solubility model based on atom type counts and ClogP as an additional descriptor), ASM-SAS (aqueous solubility model based on solvent accessible surface areas), and ASM-SAS-LOGP (aqueous solubility model based on solvent accessible surface areas and ClogP as...

متن کامل

Solubility Prediction of Anthracene in Non-Aqueous Solvent Mixtures Using Jouyban-Acree Model

      A quanitative structure property relationship was proposed to calculate the binary interaction terms of the Jouyban-Acree model using solubility parameter, boiling point, vapour pressure and density of solvents. The applicability of the proposed method for reproducing solubility data of anthracene in binary solvents has been evaluated using 116 solubility data sets collected from the lite...

متن کامل

Accurate Solubility Prediction with Error Bars for Electrolytes: A Machine Learning Approach

Accurate in silico models for predicting aqueous solubility are needed in drug design and discovery and many other areas of chemical research. We present a statistical modeling of aqueous solubility based on measured data, using a Gaussian Process nonlinear regression model (GPsol). We compare our results with those of 14 scientific studies and 6 commercial tools. This shows that the developed ...

متن کامل

Solubility of Cyproterone Derivatives in the Presence of Hydroxypropyl-β-Cyclodextrin: Experimental and Molecular Modeling Studies

      This study presents the influence of hydroxypropyl-β-cyclodextrin (HPBCD) on the aqueous solubility of acyl esters of cyproterone. First, a number of esters of cyproterone were synthesized. Then the phase solubility analysis of the compounds in the presence of HPBCD was investigated in phosphate buffer solution at a pH of 7.4. To gain a better understanding of the complexation mechanism, ...

متن کامل

Investigating the Solubility of CO2 in the Solution of Aqueous K2CO3 Using Wilson-NRF Model

Hot potassium carbonate (PC) solution in comparison with amine solution had a decreased energy of regeneration and a high chemical solubility of . To present vapor and liquid equation (VLE) of this system and predict  solubility, the ion specific non-electrolyte Wilson-NRF local composition model (isNWN) was used in this study; the framework of this model was molecular. Therefore, it was suitab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of chemical information and computer sciences

دوره 43 3  شماره 

صفحات  -

تاریخ انتشار 2003